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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

Met Val Asn Ser Ser Arg Val Gin Pro Gin 
15 10 

Gin Pro Gly Asp Ala Lys Arg Pro Pro Ala 

15 20 

Pro Arg Ala Pro Asp Pro Gly Arg Leu Met 

25 30 

Ala Gly Cys Ala Ala Val Gly Ala Ser Leu 

35 40 

Ala Ala Pro Gly Gly Leu Cys Glu Gin Arg 

45 50 

Gly Leu Glu lie Glu Met Gin Arg lie Arg 

55 60 

Gin Ala Ala Ala Arg Asp Pro Pro Ala Gly 

65 70 

Ala Ala Ala Ser Pro Ser Pro Pro Leu Ser 

75 80 

Ser Cys Ser Arg Gin Ala Trp Ser Arg Asp 

85 90 
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Asn Pro Gly Glu Glu Glu Ala Glu Glu Glu 

95 100 

Glu Glu Glu Val Glu Gly Glu Glu Gly Gly 

105 110 

Met Val Val Glu Met Asp Val Glu Trp Arg 

115 120 

Pro Gly Ser Arg Arg Ser Ala Ala Ser Ser 

125 130 

Ala Val Ser Ser Val Gly Ala Arg Ser Arg 

135 140 

Gly Leu Gly Gly Tyr His Gly Ala Gly His 

145 150 

Pro Ser Gly Arg Arg Arg Arg Arg Glu Asp 

155 160 

Gin Gly Pro Pro Cys Pro Ser Pro Val Gly 

165 170 

Gly Gly Asp Pro Leu His Arg His Leu Pro 

175 180 

Leu Glu Gly Gin Pro Pro Arg Val Ala Trp 

185 190 

Ala Glu Arg Leu Val Arg Gly Leu Arg Gly 

195 200 

Leu Trp Gly Thr Arg Leu Met Glu Glu Ser 

205 210 

Ser Thr Asn Arg Glu Lys Tyr Leu Lys Ser 

215 220 



Val Leu Arg Glu Leu Val Thr Tyr Leu Leu 

225 230 
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Phe Leu lie Val Leu Cys lie Leu Thr Tyr 

235 240 

Gly Thr Glu Ala Asp Asn Arg Ser Phe lie 

245 250 

Phe Tyr Glu Asn Leu Leu Leu Gly Val Pro 

255 260 

Arg lie Arg Gin Leu Arg Val Arg Asn Gly 

265 270 

Ser Cys Ser lie Pro Gin Asp Leu Arg Asp 

275 280 

Glu lie Lys Glu Cys Tyr Asp Val Tyr Glu 

285 290 

Thr Ala Ala Gin Val Ala Ser Leu Lys Lys 

295 300 

Asn Val Trp Leu Asp Arg Gly Thr Arg Ala 

305 310 

Thr Phe lie Asp Phe Ser Val Tyr Asn Ala 

315 320 

Asn lie Asn Leu Phe Cys Val Val Arg Leu 

325 330 

Leu Val Glu Phe Pro Ala Thr Gly Gly Val 

335 340 

lie Pro Ser Trp Gin Phe Gin Pro Leu Lys 

345 350 

Leu lie Arg Tyr Val Thr Thr Phe Asp Phe 

355 360 



Phe Leu Ala Ala Cys Glu He He Phe Cys 

365 370 
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Phe Phe He Phe Tyr Tyr Val Val Glu Glu 

375 380 

He Leu Glu Xaa He Arg He His Lys Leu 

385 390 

His Tyr Phe Arg Xaa Ser Phe Trp Asn Cys 

395 400 

Leu Asp Val Val He Val Val Leu Ser Val 

405 410 

Val Ala He Gly He Asn He Tyr Arg Thr 

415 420 

Ser Asn Val Glu Val Xaa Leu Leu Gin Phe 

425 430 

Leu Xaa Glu Asp Gin Asn Thr Phe Pro Asn 

435 440 

Phe Glu His Leu Ala Tyr Trp Gin He Gin 

445 450 

Phe Asn Asn He Ala Ala Val Thr Val Phe 

455 460 

Phe Val Trp He Lys Leu Phe Lys Phe He 

465 470 

Asn Phe Asn Arg Thr Met Ser Gin Leu Ser 

475 480 

Thr Thr Met Ser Arg Cys Ala Lys Asp Leu 

485 490 

Phe Gly Phe Ala He Met Phe Phe He He 

495 500 

Phe Leu Ala Tyr Ala Gin Leu Ala Tyr Leu 

505 510 
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Val Phe Gly Thr Gin Val Asp Asp Phe Ser 

515 520 

Thr Phe Gin Glu Cys lie Phe Thr Gin Phe 

525 530 

Arg lie lie Leu Gly Asp lie Asn Phe Ala 

535 540 

Glu lie Glu Glu Ala Asn Xaa Arg Val Leu 

545 550. 

Gly Pro lie Tyr Phe Thr Thr Phe Val Phe 

555 560 

Phe Met Phe Phe lie Leu Leu Asn Met Phe 

565 570 

Leu Ala lie lie Asn Asp Thr Tyr Ser Glu 

575 580 

Val Lys Ser Asp Leu Xaa Xaa Xaa Ala Gin 

585 590 

Gin Lys Ala Glu Met Glu Leu Ser Asp Leu 

595 600 

lie Arg Lys Gly Tyr His Lys Ala Leu Val 

605 610 

Lys Leu Lys Leu Lys Lys Asn Thr Val Asp 

615 620 

Asp lie Ser Glu Ser Leu Arg Gin Gly Gly 

625 630 

Gly Lys Leu Asn Phe Asp Glu Leu Arg Gin 

635 640 



Asp Leu Lys Gly Lys Gly His Thr Asp Ala 

645 650 
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Glu lie Glu Ala lie Phe Thr Lys Tyr Asp 

655 660 

Gin Asp Gly Asp Gin Glu Leu Thr Glu His 

665 670 

Glu His Gin Gin Met Arg Asp Asp Leu Glu 

675 680 

Lys Glu Arg Glu Asp Leu Asp Leu Asp His 

685 690 

Ser Ser Leu Pro Arg Pro Met Ser Ser Arg 

695 700 

Ser Phe Pro Arg Ser Leu Asp Asp Ser Glu 

705 710 

Glu Asp Asp Asp Glu Asp Ser Gly His Ser 

715 720 

Ser Arg Arg Arg Gly Ser lie Ser Ser Gly 

725 730 

Val Ser Tyr Glu Glu Phe Gin Val Leu Val 

735 740 

Arg Arg Val Asp Arg Met Glu His Ser lie 

745 750 

Gly Ser lie Val Ser Lys lie Asp Ala Val 

755 760 

He Val Lys Leu Glu He Met Glu Arg Ala 

765 770 

Lys Leu Lys Arg Arg Glu Val Leu Gly Arg 

775 780 



Leu Leu Asp Gly Val Ala Glu Asp Glu Arg 

785 790 
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Leu Gly Arg Asp Ser Glu lie His Arg Glu 

795 800 

Gin Met Glu Arg Leu Val Arg Glu Glu Leu 

805 810 

Glu Arg Trp Glu Ser Asp Asp Ala Ala Ser 

815 820 

Gin lie Ser His Gly Leu Gly Thr Pro Val 

825 830 

4 

Gly Leu Asn Gly Gin Pro Arg Pro Arg Ser 

835 840 

Ser Arg Pro Ser Ser Ser Gin Ser Xaa Thr 

845 850 

Glu Gly Met Glu Gly Ala Gly Gly Asn Gly 

855 860 



Ser Ser Asn Val His Val 

865 



(3) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
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Phe Leu Ala Lys Glu Glu Ala Arg Lys Val 
15 10 

Lys Arg Leu His Gly Met Leu Arg Ser Leu 

15 20 

Leu Val Tyr Met Leu Phe Leu Leu Val Thr 

25 30 

Leu Leu Ala Ser Tyr Gly Asp Ala Ser Cys 

35 40 

His Gly His Ala Tyr Xaa Arg Leu Gin Ser 

45 50 

Xaa Xaa Xaa Xaa Xaa Ala lie Lys Gin Glu 

55 60 

Leu His Ser Arg Ala Phe Leu Ala lie Thr 

65 70 

Arg Ser Glu Glu Leu Trp Pro Trp Met Ala 

75 80 

His Val Leu Leu Pro Tyr Val His Xaa Xaa 

85 90 

Xaa Xaa Xaa Gly Asn Gin Ser Ser Pro Glu 

95 100 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

105 110 

Xaa Leu Gly Pro Pro Arg Leu Arg Gin Val 

115 120 

Arg Leu Gin Glu Ala Leu Tyr Pro Asp Pro 

125 130 

Pro Gly Pro Arg Val His Thr Cys Ser Ala 

135 140 
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Ala Gly Gly Phe Ser Thr Ser Asp Tyr Asp 

145 150 

Val Gly Trp Glu Ser Pro His Asn Gly Ser 

155 160 

Gly Thr Trp Ala Thr Xaa Xaa Ser Ala Pro 

165 170 

Asp Leu Leu Gly Ala Trp Ser Trp Gly Ser 

175 180 

Cys Ala Val Tyr Asp Ser Gly Gly Tyr Val 

185 190 

Gin Glu Leu Gly Leu Ser Leu Glu Glu Ser 

195 200 

Arg Asp Arg Leu Arg Phe Leu Gin Leu His 

205 210 

Asn Trp Leu Asp Asn Arg Ser Arg Ala Val 

215 220 

Phe Leu Glu Leu Thr Arg Tyr Ser Pro Ala 

225 230 

Val Gly Leu His Ala Ala Val Thr Leu Arg 

235 240 

Leu Glu Phe Pro Ala Ala Gly Arg Ala Leu 

245 250 

Ala Ala Leu Ser Val Arg Pro Phe Ala Leu 

255 260 

Arg Arg Leu Ser Ala Gly Leu Ser Leu Pro 

265 270 



Leu Leu Thr Ser Val Cys Leu Leu Leu Phe 

275 280 
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Ala Val His Phe Ala Val Ala Glu Ala Arg 

285 290 

Thr Trp Xaa His Arg Glu Gly Arg Trp Arg 

295 300 

Val Leu Arg Leu Gly Ala Trp Ala Arg Trp 

305 310 

Leu Leu Val Ala Leu Thr Ala Ala Thr Ala 

315 320 

Leu Val Arg Leu Ala Gin Leu Gly Ala Ala 

325 330 

Asp Arg Gin Xaa Xaa Trp Thr Arg Phe Val 

335 340 

Arg Gly Arg Pro Arg Arg Phe Thr Ser Phe 

345 350 

Asp Gin Val Ala Gin Leu Ser Ser Ala Ala 

355 360 

Arg Gly Leu Ala Ala Ser Leu Leu Phe Leu 

365 370 

Leu Leu Val Lys Ala Ala Gin Gin Leu Arg 

375 380 

Phe Val Arg Gin Trp Ser Val Phe Gly Lys 

385 390 

Thr Leu Cys Arg Ala Leu Pro Glu Leu Leu 

395 400 

Gly Val Thr Leu Gly Leu Val Val Leu Gly 

405 410 



Val Ala Tyr Ala Gin Leu Ala lie Leu Leu 

415 420 
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Val Ser Ser Cys Val Asp Ser Leu Trp Ser 

425 430 

Val Ala Gin Ala Leu Leu Xaa Xaa Xaa Xaa 

435 440 

Val Leu Cys Pro Gly Thr Gly Leu Ser Thr 

445 450 

Leu Cys Pro Ala Glu Ser Trp His Leu Ser 

455 460 

Pro Leu Leu Cys Val Gly Leu Trp Ala Leu 

465 470 

Arg Leu Trp Gly Ala Leu Arg Leu Gly Ala 

475 480 

Val lie Leu Arg Trp Arg Tyr His Ala Leu 

485 490 

Arg Gly Glu Leu Tyr Arg Pro Ala Trp Glu 

495 500 

Pro Gin Asp Tyr Glu Met Val Glu Leu Phe 

505 510 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Arg Arg 

515 520 

Leu Arg Leu 



(4) INFORMATION FOR SEQ ID NO : 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
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(A) DESCRIPTION: peptide 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal fragmen" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 



Glu Asn Arg Lys Met Arg Asp Glu Gin Leu 

15 10 

Phe lie Thr lie Arg Asp Met Leu Cys Phe 

15 20 

Phe Ala Ser Leu Tyr lie Met Val Met Leu 

25 30 

Thr Tyr Tyr Cys Lys Asp Arg His Gly Tyr 

35 40 

Trp Tyr Gin Leu Glu Met Ser Thr lie Leu 

45 50 

Asn lie Asn Gin Lys Asn Tyr Gly Asp Asn 

55 60 

Thr Xaa Phe Met Ser lie Gin His Ala Asp 

65 70 

Asp Phe Trp Asp Trp Ala Arg Glu Ser Leu 

75 80 

Ala Thr Ala Leu Leu Ala Ser Trp Tyr Asp 

85 90 

Gly Asn Pro Ala Tyr Gly Met Arg Ala Tyr 

95 100 

Met Asn Asp Lys Val Ser Arg Ser Met Gly 

105 110 
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He Gly Thr He Arg Gin Val Arg Thr Lys 

115 120 

Lys Ser Glu He He Thr Leu Phe Asn Lys 

125 130 

Leu Asp Ser Glu Arg Trp He Asp Asp His 

135 140 

Thr Arg Ala Val He He Glu Phe Ser Ala 

145 150 

Tyr Asn Ala Gin He Asn Tyr Phe Ser Val 

155 150 

Val Gin Leu Leu Val Glu He Pro Lys Ser 

165 170 

Gly He Tyr Leu Pro Asn Ser Trp Val Glu 

175 180 

Ser Val Arg Leu lie Lys Ser Glu Gly Ser 

185 190 

Asp Gly Thr Val Val Lys Tyr Tyr Glu Met 

195 200 

Leu Tyr He Phe Phe Ser Val Leu He Phe 

205 210 

Val Lys Glu He Val Trp Asn Phe Met Asp 

215 220 

Leu He Val Gly Ala Leu Ala Val Ala Ser 

225 230 



Val Leu Ala Tyr Thr He Arg Gin Arg Thr 

235 240 



Thr Asn Arg Ala Met Glu Asp Phe Asn Ala 

245 250 
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Asn Asn Gly Asn Ser Tyr lie Asn Leu Thr 

255 260 

Glu Gin Arg Asn Trp Glu He Val Phe Ser 

265 270 

Tyr Cys Leu Ala Gly Ala Val Phe Phe Thr 

275 280 

Ser Cys Lys Met He Arg He Leu Arg Phe 

285 290 

Asn Arg Arg He Gly Val Leu Ala Ala Thr 

295 300 

Leu Asp Asn Ala Leu Gly Ala He Val Ser 

305 310 

Phe Gly lie Ala Phe Leu Phe Phe Ser Met 

315 320 

Thr Phe Asn Ser Val Leu Tyr Ala Val Leu 

325 330 

Gly Asn Lys Met Gly Gly Tyr Arg Ser Leu 

335 340 

Met Ala Thr Phe Gin Thr Ala Leu Ala Gly 

345 350 

Met Leu Gly Lys Leu Asp Val Thr Ser He 

355 360 

Gin Pro Xaa Xaa Xaa Xaa Xaa He Ser Gin 

365 370 

Phe Ala Phe Val Val He Met Leu Tyr Met 

375 380 



He Glu Phe Glu Glu He Arg Asn Asp Ser 

385 390 
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Glu Lys Gin Thr Asn Asp Tyr Glu lie 

395 



(5) INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



Phe Thr Met Val Phe Ser Leu Glu Cys Val 
15 10 

Leu Lys Val lie Ala Phe Gly Phe Leu Asn 

15 20 

Tyr Phe Arg Xaa Asp Thr Trp Asn lie Phe 

25 30 

Asp Phe lie Thr Val He Gly Ser He Thr 

35 40 

Glu He He Leu Thr Asp Ser Lys Leu Val 

45 50 

Asn Thr Ser Gly Xaa Phe Xaa Xaa Xaa Xaa 

55 60 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 



11939.1 



i 



-36- 



Xaa Asn Met Ser Phe Leu Lys Xaa Xaa Xaa 

75 * 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Phe 

85 90 

Arg Ala Ala Arg Leu lie Lys Leu Leu Arg 

95 100 

Gin Gly Tyr Thr lie Arg He Leu Leu Trp 

105 110 

Thr Phe Val Gin Ser Phe Lys Ala Leu Pro 

115 120 

Tyr Val Cys Leu Leu He Ala Met Leu Phe 

125 130 

Phe He Tyr Ala He He Gly Met Gin Val 

135 140 

Phe Gly Asn Asn Phe Arg Ser Phe Phe Gly 

145 150 

Ser Leu Met Leu Leu Phe Arg Ser Ala Thr 

155 160 

Gly Glu Xaa Ala Trp Gin Glu He Glu Arg 

165 170 

Cys Gly Xaa Thr Asp Leu Ala Tyr Val Tyr 

175 18Q 

Phe Val Ser Phe lie Phe Phe Cys Ser Phe 

185 190 

Leu Met Leu Asn Leu Phe Val Ala Val lie 

195 200 



Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp 

205 210 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

215 220 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

225 230 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

235 240 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

245 250 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

255 260 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa lie Leu Gly 

265 270 

Pro His His Leu Asp Xaa Glu Phe Val Arg 

275 280 

Val Trp Ala Glu Tyr Asp Arg Ala Ala Cys 

285 290 

Gly Arg lie His Tyr Thr Glu Met Tyr Glu 

295 300 

Met Glu Arg Arg Arg Ser Lys Glu Arg Lys 

305 310 

His Leu Leu Ser Pro Asp Val Ser Arg Cys 

315 320 

Asn Ser Glu Glu Arg Gly Thr Gin Ala Asp 

325 330 

Trp Glu Ser Pro Glu Arg Arg Gin Ser Arg 

335 340 



Ser Pro Ser Glu Gly Arg Ser Gin Thr Pro 

345 350 
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Asn Arg Gin Gly Thr Gly Ser Leu Ser Glu 

355 360 



Ser Ser lie 



(4) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

Glu lie Ala lie Phe Thr Lys Tyr Asp Gin 
15 10 

Asp Gly Asp Gin Glu Leu Thr Glu His Glu 

15 20 

His Gin Gin Met Arg Asp Asp Leu 

25 



(5) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5057 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: genomic DNA 
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(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: 

GGCTCCTGAG GCGCACAGCG CCGAGCGCGG 
GACGCCAGTG ACCGCGATGG TGAACTCCAG 
CCGGGGACGC CAAGCGGCCG CCCGCGCCCC 
CTGATGGCTG GCTGCGCGGC CGTGGGCGCC 
CCTCTGCGAG CAGCGGGGCC TGGAGATCGA 
CGGCCGCGCG GGACCCCCCG GCCGGAGCCG 
CTCTCGTCGT GCTCCCGGCA GGCGTGGAGC 
GGCCGAGGAG GAGGAGGAGG AGGTGGAAGG 
TGGAGATGGA CGTAGAGTGG CGCCCGGGCA 
TCGGCCGTGA GCTCCGTGGG CGCGCGGAGC 
CGGCGCGGGC CACCCGAGCG GGAGGCGGCG 
CGCCGTGCCC CAGCCCAGTC GGCGGCGGGG 
CCCCTGGAAG GGCAGCCGCC CCGAGTGGCC 
CGGGCTGCGA GGTCTCTGGG GAACAAGACT 
ACCGAGAGAA ATACCTTAAA AGTGTTTTAC 
CTTTTTCTCA TAGTCTTGTG CATCTTGACC 
TGTGTACTAC TACACCCGGA TGATGTCACA 
TGTCCAAAAC GGAGAAAACT AACTTTAAAA 
TTCTGGAAGT TCACAGAAGG CTCCTTATTG 
GCAGCCCAGC AACCAGACTG AAGCTGACAA 
AGAACCTGCT GTTAGGGGTT CCACGAATAC 
GGATCCTGCT CTATCCCCCA GGACTTGAGA 
TGATGTCTAC TCTGTCAGTA GTGAAGATAG 
ATGGAACCGC TTGGATCTAC ACAAGTGAAA 
CACTGGGGAA TCATTGCAAC TTATAGTGGA 
GTCAAGAACA AGAGAGGAAA CAGCTGCACA 
ATGTCTGGCT GGACCGAGGA ACCAGGGCAA 
TACAACGCCA ACATTAACCT GTTCTGTGTG 
CCCAGCAACA GGTGGTGTGA TTCCATCTTG 
TGATCCGATA TGTCACAACT TTTGATTTCT 
ATCTTTTGTT TCTTTATCTT TTACTATGTG 
TCGCATTCAC AAACTACACT ATTTCAGGAG 
TTGTGATCGT TGTGCTGTCA GTGGTAGCTA 
ACATCAAATG TGGAGGTGCT ACTACAGTTT 
CCCCAACTTT GAGCATCTGG CATATTGGCA 
CTGCTGTCAC AGTATTTTTT GTCTGGATTA 
TTTAACAGGA CCATGAGCCA GCTCTCGACA 
AGACCTGTTT GGCTTTGCTA TTATGTTCTT 



SEQ ID NO: 


6 




CGCCGCGCAC 


CCGCGCGCCG 


50 


TCGCGTGCAG 


CCTCAGCAGC 


100 


GCGCGCCGGA 


CCCGGGCCGG 


150 


AGCCTCGCCG 


CCCCGGGCGG 


200 


GATGCAGCGC 


ATCCGGCAGG 


250 


CGGCCTCCCC 


TTCTCCTCCG 


300 


CGCGATAACC 


CCGGCTTCGA 


350 


GGAAGAAGGC 


GGAATGGTGG 


400 


GCCGGAGGTC 


GGCCGCCTCC 


450 


CGGGGGCTTG 


GGGGCTACCA 


500 


CCGGCGAGAG 


GACCAGGGCC 


550 


ACCCGCTGCA 


TCGCCACCTC 


600 


TGGGCGGAGA 


GGCTGGTTCG 


650 


CATGGAGGAA 


AGCAGCACTA 


700 


GGGAACTGGT 


CACATACCTC 


750 


TACGGCATGA 


TGAGCTCCAA 


800 


GCTCTTCCTA 


GACACCCCCG 


850 


CTCTGTCTTC 


CATGGAAGAC 


900 


GATGGGCTGT 


ACTGGAAGAT 


950 


CCGAAGTTTC 


ATCTTCTATG 


1000 


GGCAACTCCG 


AGTCAGAAAT 


1050 


GATGAAATTA 


AAGAGTGCTA 


1100 


GGCTCCCTTT 


GGGCCCCGAA 


1150 


AAGACTTGAA 


TGGTAGTAGC 


1200 


GCTGGCTATT 


ATCTGGATTT 


1250 


AGTTGCTAGC 


CTCAAGAAAA 


1300 


CTTTTATTGA 


CTTCTCAGTG 


1350 


GTCAGGTTAT 


TGGTTGAATT 


1400 


GCAATTTCAG 


CCTTTAAAGC 


1450 


TCCTGGCAGC 


CTGTGAGATT 


1500 


GTGGAAGAGA 


TATTGGAAAT 


1550 


TTTCTGGAAT 


TGTCTGGATG 


1500 


TAGGAATTAA 


CATATACAGA 


1650 


CTGGAAGATC 


AAAATACTTT 


1700 


GATACAGTTC 


AACAATATAG 


1750 


AGCTCTTCAA 


ATTCATCAAT 


• 1800 


ACCATGTCTC 


GATGTGCCAA 


1850 


CATTATTTTC 


CTAGCGTATG 


1900 
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CTCAGTTGGC ATACCTTGTC TTTGGCACTC 
TTCCAAGAGT GTATCTTCAC TCAATTCCGT 
CTTTGCAGAG ATTGAGGAAG CTAATCGAGT 
CTACATTTGT GTTCTTTATG TTCTTCATTC 
ATCATCAATG ATACTTACTC TGAAGTGAAA 
AGCTGAAATG GAACTGTCAG ATCTTATCAG 
TGGTCAAACT AAAACTGAAA AAAAATACCG 
CTGCGGCAAG GAGGAGGCAA GTTAAACTTT 
CAAAGGGAAG GGCCATACTG ATGCAGAGAT 
ACGACCAAGA TGGAGACCAA GAACTGACCG 
AGAGACGACT TGGAGAAAGA GAGGGAGGAC 
TTTACCACGT CCCATGAGCA GCCGAAGTTT 
CTGAGGAGGA TGACGATGAA GATAGCGGAC 
AGCATTTCTA GTGGCGTTTC TTACGAAGAG 
AGTGGACCGG ATGGAGCATT CCATCGGCAG 
CCGTGATCGT GAAGCTAGAG ATTATGGAGC 
GAGGTGCTGG GAAGGCTGTT GGATGGGGTG 
TCGTGACAGT GAAATCCATA GGGAACAGAT 
AGTTGGAACG CTGGGAATCC GATGATGCAG 
TTAGGCACGC CAGTGGGACT AAATGGTCAA 
CCCATCTTCC TCCCAATCTA CAGAAGGCAT 
GGAGTTCTAA TGTCCACGTA TGATATGTGT 
AATAAGTGAG GAAGTGGCTG TCCTGAATTG 
ATATGCCCTG ACCACCATAG GATGCTAGTC 
TTCTGCACTT TAATTTATTT TATATAAACT 
TTTTTTTCTT TTTCTCATAT AAGAAATCTA 
AAAAAAAATC TTCATGATGT GTATTGAGCG 
GACTGAGTCT TCTCAGTTGA CAATGAAGTA 
TGTCAAAGGG CTTCTGAGTT TCATTTCCAG 
ATTTTTTTCC AAGAGTGTGA AGGAAAATGG 
GCATAGTTCA TGAGCTTAAT ACATAGCTTT 
TTTTTCAACT AGCTTCCTGG GGTAAACTTT 
GGAACTCCAA ACTATGATAG AATCTGTGTG 
AATACTATGC TTTTTTGTAA GTTGATCGTA 
CTGTATCACT TAATTTTTAC CTTATTTTGG 
GTAAAACCAC CAAAGATCAG TTATAGGATA 
ACACAGGAGA ATTGGAAGGA GCCCTAAGTT 
TTAATGGTTA GTTTAGCCTA AAGATTTATC 
TGGCTCTACT CATTTGCAAC TGAATTTAAT 
ACCAACTTAC TAAATTTTTA GTATGCACTG 
ATGTTCATTT TAAGCAAAAT TTTAAGAAAG 
TTGGTTTTAA ACTATTTTAA GAATATAGTA 
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AGGTCGATGA 


CTTCAGTACT 


1950 


ATCATTTTGG 


GCGATATCAA 


2000 


TTTGGGACCA 


ATTTATTTCA 


2050 


TTTTGAATAT 


GTTTTTGGCT 


2100 


TCTGACTTGG 


CACAGCAGAA 


2150 


AAAGGGCTAC 


CATAAAGCTT 


2200 


TGGATGACAT 


TTCAGAGAGT 


2250 


GACGAACTTC 


GACAAGATCT 


2300 


TGAGGCAATA 


TTCACAAAGT 


2350 


AACATGAACA 


TCAGCAGATG 


2400 


CTGGATTTGG 


ATCACAGTTC 


2450 


CCCTCGAAGC 


CTGGATGACT 


2500 


ATAGCTCCAG 


AAGGAGGGGA 


2550 


TTTCAAGTCC 


TGGTGAGACG 


2600 


CATAGTGTCC 


AAGATTGACG 


2650 


GAGCCAAACT 


GAAGAGGAGG 


2700 


GCCGAGGATG 


AAAGGCTGGG 


2750 


GGAACGGCTA 


GTACGTGAAG 


2800 


CTTCCCAGAT 


CAGTCATGGT 


2850 


CCTCGCCCCA 


GAAGCTCCCG 


2900 


GGAAGGTGCA 


GGTGGAAATG 


2950 


GTTTCAGTAT 


GTGTGTTTCT 


3000 


CTGTAACAAG 


CACACTATTT 


3050 


TTTGTGACCG 


ATTGCTAATC 


3100 


TTACCCATGG 


TTCAAAGATT 


3150 


GGTGTAAATA 


TTGAGTACAG 


3200 


GTACGCCCAG 


TTGCCACCAT 


3250 


GCCTTTTAAA 


GCTAGAAAAC 


3300 


TCACAAAAAT 


CAGTATTGTT 


3350 


GGCAATTCCT 


TTCCACTCTG 


3400 


CTTTTAAGAA 


AGGAGCCTTT 


3450 


TCTAAAAGAT 


AAAATGGGAA 


3500 


AATGGTTAAG 


ATGAATGTTA 


3550 


TCTGATGTCT 


GTGGGACTAA 


3600 


CTCTAATTTG 


AATAAGCTGA 


3650 


AAATGGCATC 


TCTAACCATA 


3700 


GTCACTCAGT 


TTAATTTCTT 


3750 


TGCATATTCT 


TTTTCCCATG 


3800 


GTTATAACTC 


ATCTAGTGAG 


3850 


AAAGTTTTTA 


TCCAACAATT 


3 900 


TTTTGAAATT 


CATAAAGCAT 


3950 


CTCGGTCAGG 


TATGNNNCAC 


4000 
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GCCTGTAATC CCAGCACTTT GGGAGGCCGA AACAGGCGAA TCACTTGAGC 4 050 

CCAGGAGTTC AAGACCAACA TGGGCAATGT GGCGAAACTC CATCTCTACA 4100 

AAAAATGCAA AAATAAAAAA TATAGTACTC AAGTATTCTT GATCCTGTGT 4150 

TTCAAAACTA GAATTTGTAA TGCAAATGGA GCTCAGTCTA ATAAAAAAGA 4 200 

GGTTTTGGTA TTAAAAGTTC ATACATTAGA CAGTATCAGC CAAAATTTGA 4 250 

GTTAGCAACA CTGTTTTCTT TACGAGAGGG TCTCACCCAA ATTTATGGGG 4 300 

AGAAATCTAT TTCTCAAAAA AAAAAAATCT TCTTTTACAG AAATGTTGAG 4 350 

TAAGGTGACA TTTTGAGCGC TAATAAGCAA AAGAGCATGC AGTGCTGTTG 44 00 

AATAACCCTC ACTTGGAGAA CCAAGAGAAT CCTGTCGTTT AATGCTATAT 4 4 50 

TTTAATTTCA CAAGTTGTTC ATTTAACTGG TAGAATGTCA GTCCAATCTC 4 500 

CAATGAGAAC ATGAGCAAAT AGACCTTTCC AGGTTGAAAG TGAAACATAC 4 550 

TGGGTTTCTG TAAGTTTTTC CTCATGGCTT CATCTCTATC TTTACTTTCT 4 600 

CTTGAATATG CTACACAAAG TTCTTTATTA CTACATACTA AAGTTTGCAT 4 650 

TCCAGGGATA TTGACTGTAC ATATTTATGT ATATGTACCA TGTTGTTACA 4 700 

TGTAAACAAA CTTCAATTTG AAGTGCAGCT ATTATGTGGT ATCCATGTGT 4750 

ATCGACCATG TGCCATATAT CAATTATGGT CACTAGAAAG TCTCTTTATG 4 800 

ATACTTTTTA TTGTACTGTT TTTCATTTCA CTTGCAAAAT TTTGCAGAAT 4 850 

TCCTCCTTTC TACCCATAAA TTACATATAA TTTTTCTTCT TTAGTCATGG 4 900 

AGAACNCCCC CCCATCATCT CANCCCTATT ANCTTTCCCA TGTGTACTGG 4 950 

TATTATTAAA AAGACATTTA CATACGCAAG TTTTTCACTG ACAANCAAGA 5000 

ATGTTATTAA TGTGTAATAC TGAGCACNTT TACTTCTTAA TAAAAACTTG 5050 

ATATANT 5 057 



(5) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

Met Val Asn Ser Ser Arg Val Gin Pro Gin Gin Pro Gly Asp Ala Lys 
15 10 15 
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Arg Pro Pro Ala Pro Arg Ala Pro Asp Pro Gly Arg Leu Met Ala Gly 
20 25 30 

Cys Ala Ala Val Gly Ala Ser Leu Ala Ala Pro Gly Gly Leu Cys Glu 
35 40 45 

Gin Arg Gly Leu Glu lie Glu Met Gin Arg lie Arg Gin Ala Ala Ala 
50 55 60 

Arg Asp Pro Pro Ala Gly Ala Ala Ala Ser Pro Ser Pro Pro Leu Ser 
65 70 75 80 

Ser Cys Ser Arg Gin Ala Trp Ser Arg Asp Asn Pro Gly Phe Glu Ala 

85 90 95 

Glu Glu Glu Glu Glu Glu Val Glu Gly Glu Glu Gly Gly Met Val Val 
100 105 110 

Glu Met Asp Val Glu Trp Arg Pro Gly Ser Arg Arg Ser Ala Ala Ser 
115 120 125 

Ser Ala Val Ser Ser Val Gly Ala Arg Ser Arg Gly Leu Gly Gly Tyr 
130 135 140 

His Gly Ala Gly His Pro Ser Gly Arg Arg Arg Arg Arg Glu Asp Gin 
145 150 155 160 

Gly Pro Pro Cys Pro Ser Pro Val Gly Gly Gly Asp Pro Leu His Arg 

165 170 175 

His Leu Pro Leu Glu Gly Gin Pro Pro Arg Val Ala Trp Ala Glu Arg 
180 185 190 

Leu Val Arg Gly Leu Arg Gly Leu Trp Gly Thr Arg Leu Met Glu Glu 
195 200 205 

Ser Ser Thr Asn Arg Glu Lys Tyr Leu Lys Ser Val Leu Arg Glu Leu 
210 215 220 



Val Thr Tyr Leu Leu Phe Leu lie Val Leu Cys lie Leu Thr Tyr Gly 
225 230 235 240 
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Met Met Ser Ser Asn Val Tyr Tyr Tyr Thr Arg Met Met Ser Gin Leu 

245 250 255 

Phe Leu Asp Thr Pro Val Ser Lys Thr Glu Lys Thr Asn Phe Lys Thr 
260 265 270 

Leu Ser Ser Met Glu Asp Phe Trp Lys Phe Thr Glu Gly Ser Leu Leu 
275 280 285 

Asp Gly Leu Tyr Trp Lys Met Gin Pro Ser Asn Gin Thr Glu Ala Asp 
290 295 300 

Asn Arg Ser Phe lie Phe Tyr Glu Asn Leu Leu Leu Gly Val Pro Arg 
305 310 315 320 

lie Arg Gin Leu Arg Val Arg Asn Gly Ser Cys Ser lie Pro Gin Asp 

325 330 335 

Leu Arg Asp Glu lie Lys Glu Cys Tyr Asp Val Tyr Ser Val Ser Ser 
340 345 350 

Glu Asp Arg Ala Pro Phe Gly Pro Arg Asn Gly Thr Ala Trp lie Tyr 
355 360 365 

Thr Ser Glu Lys Asp Leu Asn Gly Ser Ser His Trp Gly lie lie Ala 
370 375 380 

Thr Tyr Ser Gly Ala Gly Tyr Tyr Leu Asp Leu Ser Arg Thr Arg Glu 
385 390 395 400 

Glu Thr Ala Ala Gin Val Ala Ser Leu Lys Lys Asn Val Trp Leu Asp 

405 410 415 

Arg Gly Thr Arg Ala Thr Phe lie Asp Phe Ser Val Tyr Asn Ala Asn 
420 425 430 

He Asn Leu Phe Cys Val Val Arg Leu Leu Val Glu Phe Pro Ala Thr 
435 440 445 

Gly Gly Val He Pro Ser Trp Gin Phe Gin Pro Leu Lys Leu He Arg 
450 455 460 



11939-1 



-44- 

Tyr Val Thr Thr Phe Asp Phe Phe Leu Ala Ala Cys Glu lie lie Phe 
465 470 475 480 

Cys Phe Phe lie Phe Tyr Tyr Val Val Glu Glu lie Leu Glu lie Arg 

485 490 495 

lie His Lys Leu His Tyr Phe Arg Ser Phe Trp Asn Cys Leu Asp Val 
500 505 510 

Val He Val Val Leu Ser Val Val Ala He Gly He Asn He Tyr Arg 
515 520 525 

Thr Ser Asn Val Glu Val Leu Leu Gin Phe Leu Glu Asp Gin Asn Thr 
530 535 540 

Phe Pro Asn Phe Glu His Leu Ala Tyr Trp Gin He Gin Phe Asn Asn 
545 550 555 560 

He Ala Ala Val Thr Val Phe Phe Val Trp lie Lys Leu Phe Lys Phe 

565 570 575 

lie Asn Phe Asn Arg Thr Met Ser Gin Leu Ser Thr Thr Met Ser Arg 
580 585 590 

Cys Ala Lys Asp Leu Phe Gly Phe Ala He Met Phe Phe He lie Phe 
595 600 605 

Leu Ala Tyr Ala Gin Leu Ala Tyr Leu Val Phe Gly Thr Gin Val Asp 
610 615 620 

Asp Phe Ser Thr Phe Gin Glu Cys He Phe Thr Gin Phe Arg He lie 
625 630 635 640 

Leu Gly Asp lie Asn Phe Ala Glu lie Glu Glu Ala Asn Arg Val Leu 

645 650 655 

Gly Pro lie Tyr Phe Thr Thr Phe Val Phe Phe Met Phe Phe lie Leu 
660 665 670 

Leu Asn Met Phe Leu Ala He lie Asn Asp Thr Tyr Ser Glu Val Lys 
675 680 685 
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Ser Asp Leu Ala Gin Gin Lys Ala Glu Met Glu Leu Ser Asp Leu lie 
690 695 700 

Arg Lys Gly Tyr His Lys Ala Leu Val Lys Leu Lys Leu Lys Lys Asn 
705 710 715 720 

Thr Val Asp Asp lie Ser Glu Ser Leu Arg Gin Gly Gly Gly Lys Leu 

725 730 735 

Asn Phe Asp Glu Leu Arg Gin Asp Leu Lys Gly Lys Gly His Thr Asp 
740 745 750 

Ala Glu lie Glu Ala lie Phe Thr Lys Tyr Asp Gin Asp Gly Asp Gin 
755 760 765 

Glu Leu Thr Glu His Glu His Gin Gin Met Arg Asp Asp Leu Glu Lys 
770 775 780 

Glu Arg Glu Asp Leu Asp Leu Asp His Ser Ser Leu Pro Arg Pro Met 
785 790 795 800 

Ser Ser Arg Ser Phe Pro Arg Ser Leu Asp Asp Ser Glu Glu Asp Asp 

805 810 815 

Asp Glu Asp Ser Gly His Ser Ser Arg Arg Arg Gly Ser lie Ser Ser 
820 825 830 

Gly Val Ser Tyr Glu Glu Phe Gin Val Leu Val Arg Arg Val Asp Arg 
835 840 845 

Met Glu His Ser lie Gly Ser lie Val Ser Lys lie Asp Ala Val He 
850 855 860 

Val Lys Leu Glu He Met Glu Arg Ala Lys Leu Lys Arg Arg Glu Val 
865 870 875 880 

Leu Gly Arg Leu Leu Asp Gly Val Ala Glu Asp Glu Arg Leu Gly Arg 

885 890 895 



Asp Ser Glu He His Arg Glu Gin Met Glu Arg Leu Val Arg Glu Glu 
900 905 910 
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Leu Glu Arg Trp Glu Ser Asp Asp Ala Ala Ser Gin lie Ser His Gly 
915 920 925 

Leu Gly Thr Pro Val Gly Leu Asn Gly Gin Pro Arg Pro Arg Ser Ser 
930 935 940 

Arg Pro Ser Ser Ser Gin Ser Thr Glu Gly Met Glu Gly Ala Gly Gly 
945 950 955 960 

Asn Gly Ser Ser Asn Val His Val 

965 

(7) INFORMATION FOR SEQ ID NO : 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: genomic DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

GGGCTACCAT AAAGCTTTG 19 



(8) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: genomic DNA 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 



;9) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: genomic DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 



(10) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: genomic DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



GTTCATGTTC GATCAGTTCT 



20 



GGGCTAGAAA TACTCTTATC ACC 



23 



GCCTCAAGTG TTCCACTGAT 



20 



(11) 



INFORMATION FOR SEQ ID NO: 12 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: genomic DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

AGGTTTTTCT GGGTAACCCT AG 22 
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